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1 Introduction 

The case for alternate gravity is easily made. The best that can be done from 
observing cosmic motions is to infer the metric g^ v in some coordinate system. 
From this one can reconstruct the Einstein tensor and then ask whether or not 
general relativity predicts it in terms of the observed sources of stress-energy, 

R^-\g^R) =8ttG(t / J ? (1) 

Z / rcc V / obs 

One way of explaining any disagreement is by positing the existence of an 
unobserved, "dark" component of the stress-energy tensor, 

W d , k -8^K-^) rcc -( T -) obs - < 2 > 

This always works, but recent observations make it seem epicyclic. 

The theory of nucleosynthesis implies that no more than about 4% of the 
energy density currently required to make general relativity agree with all 
observations can consist of any material with which we are presently familiar 

— and only a fraction of this 4% is observed. Just to make general relativity 
agree with the observed motions of galaxies and galactic clusters we must posit 
that six times the mass of ordinary matter comes in the form of nonbaryonic, 
cold dark matter [2]. Although there are some plausible candidates for what 
this might be, no Earth-bound laboratory has yet succeeded in detecting it. 

I belong to the minority of physicists who feel that this factor of six already 
strains credulity. Easing that strain is what led Milgrom to propose MOND 
, which can be viewed as a phenomenological modification of gravity in the 
regime of very small accelerations. There is an impressive amount of obser- 
vational data in favor of this modification 0] — although see Bekenstein 
has recently constructed a fully relativistic field theory jS] which reproduces 
MOND, and a preliminary analysis of the resulting cosmology works better 
than many experts thought possible [7|. 
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However, the worst problem for conventional gravity comes on the largest 
scales. To make general relativity agree with the Hubble plots of distant Type 
la supernovae :8. Qi llOj . with the power spectrum of anisotropies in the cosmic 
microwave background 11 and with large scale structure surveys j!2j . one 
must accept an additional component of "dark energy" that is about eighteen 
times larger than that of ordinary matter. This would mean that 96% of the 
current universe's energy exists in forms which have so far only been detected 
gravitationally! Even people who believe passionately in dark matter (and 
hence accept the factor of six) find this factor of 6+18 = 24 difficult to swallow. 
That is why there has been so much recent interest in modifying gravity to 
make it predict observed cosmic phenomena without the need for dark energy, 
and sometimes even without the need for dark matter. 

I want to stress that the issue is one of plausibility. There is no problem 
inventing field theories which give the required amount of dark energy. The 
simplest way of doing it is with a minimally coupled scalar ^] , 

C = -\d^ v g^^—g - V{<p)^ ■ (3) 

The usual procedure is to begin with a scalar potential V(ip) and work out 
the cosmology, but it is easy to start with whatever cosmological evolution is 
desired and construct the potential which would support it. I will go through 
the construction here, both to make the point and so that it can be used later. 

On the largest scales the geometry of the universe can be described in 
terms of a single function of time known as the scale factor a(t), 

ds 2 = -dt 2 + a 2 (t)dx ■ dx . (4) 

The logarithmic time derivative of this quantity gives the Hubble parameter, 

H(t) = * . (5) 
a 

If we specialize to a solution <po (t) of the scalar field equations which depends 
only upon time, the two nontrivial Einstein equations are, 

3# 2 = 8ttG(~^ + FM) , (6) 
-2H - 3H 2 = 8ttG(~^ - V(ip )) ■ (7) 

Let us assume a(t) is known as an explicit function of time, and construct 
<Po(t) and V(<p). By adding JJjJ and Q we obtain, 

-2H = 8nG(pl . (8) 

The weak energy condition implies H(t) < so we can take the square root 
and integrate to solve for (fio(t), 
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M o- m ±£«J=*M. ( 9 ) 

One can choose ipj and the sign freely. 

Because the integrand in @ is always positive, the function ipa(t) is mono- 
tonic. This means we can invert to solve for time as a function of ipo. Let us 
call the inverse function T(ip), 

^ = V o(T(V)). (10) 

By subtracting J7J) from |JBJ| we obtain a relation for the scalar potential as a 
function of time, 

V=^(H(t)+3H*(t)) . (11) 

The potential is determined as a function of the scalar by substituting the 
inverse function l|lUfl . 

V(<P) = +3i? 2 (TM)| . (12) 

This construction gives a scalar which supports any evolution a(t) (with 
H(t) < 0) all by itself. Should you wish to include some other, known compo- 
nent of the stress-energy, simply add the energy density and pressure of this 
component to the Einstein equations, 

3H 2 = 87rG(~<^ + V(ifio) + Pknown) , (13) 

-2H - 3H 2 = 8ttG(~$ - V(<p ) + Pknown ) . (14) 

Provided pknown and Pknown are known functions of either time or the scale 
factor, the construction goes through as before. 1 

Using this method one can devise a new field ip(x) which will support any 
cosmology with H(t) < 0. However, the introduction of such a "quintessence" 
field raises a number of questions: 

1. Where does tp reside in fundamental theory? 

2. Why can't <p couple to fields other than the metric? And if it does cou- 
ple to other fields, why haven't we detected its influence in Earth-bound 
laboratories? 

3. Why did if come to dominate the stress-energy of the universe so recently 
in cosmological time? 

4. Why is the <p field so homogeneous? 



1 This construction seems to be due to Ratra and Peebles 1141 . Recent examples of 
its use include [T51IT5in7| . 
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When a phenomenological fix raises more questions than it answers people 
are naturally drawn to investigate other fixes. One possibility is that general 
relativity is not the correct theory of gravity on cosmological scales. 
In this talk I shall review gravitational Lagrangians of the form, 

C-^R + ARb])^, (15) 

where is some local scalar constructed from the curvature tensor and 

possibly its covariant derivatives. Examples of such scalars are, 

^R af} R aP , \g»"R,vR, v , ^ 2 sm^R a ^R aPpa y (16) 

I begin by reviewing a powerful no-go theorem which pervades and constrains 
fundamental theory so completely that most people assume its consequence 
without thinking. This is the theorem of Ostrogradski who essentially 
showed why Newton was right to suppose that the laws of physics involve 
no more than two time derivatives of the fundamental dynamical variables. 
The key consequence for our purposes is that the only viable form for the 
functional Z\-R[g] in 1)15(1 is an algebraic function of the undifferentiated Ricci 
scalar, 

AR[g] = f{R) . (17) 

I review the Ostrogradski result in section 2, and hopefully immunize you 
against some common misconceptions about it in section 3. In section 4 I ex- 
plain why f(R) theories do not contradict Ostrogradski's result. I also demon- 
strate that, in the absence of matter, f(R) theories are equivalent to ordinary 
gravity, with f(R) = 0, plus a minimally coupled scalar of the form J3J). Then 

1 use the construction given above to show how one can choose f(R) to en- 
force an arbitrary cosmology. This establishes that an f(R) can be found to 
support any desired cosmology. In section 5 I discuss problems associated with 

4 

the particular choice function f(R) = — Section 6 presents conclusions. 

2 The Theorem of Ostrogradski 

Ostrogradski's result is that there is a linear instability in the Hamiltonians as- 
sociated with Lagrangians which depend upon more than one time derivative 
in such a way that the dependence cannot be eliminated by partial integration 
|18| . The result is so general that I can simplify the discussion by presenting 
it in the context of a single, one dimensional point particle whose position as 
a function of time is q(t). First I will review the way the Hamiltonian is con- 
structed for the usual case in which the Lagrangian involves no higher than 
first time derivatives. Then I present Ostrogradski's construction for the case 
in which the Lagrangian involves second time derivatives. And the section 
closes with the generalization to N time derivatives. 
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In the usual case of L = L(q, g), the Euler-Lagrange equation is, 

dL ddL 

ir q -dm= Q - (18) 

The assumption that ^ depends upon q is known as nondegeneracy. If the 
Lagrangian is nondegenerate we can write (|18f) in the form Newton assumed 
so long ago for the laws of physics, 

q = T{q,q) =>■ q(t) = Q{t, q , q ) . (19) 

From this form it is apparent that solutions depend upon two pieces of initial 
value data: q = <?(0) and q = ?(0). 

The fact that solutions require two pieces of initial value data means that 
there must be two canonical coordinates, Q and P. They are traditionally 
taken to be, 

dL 

Q = q and P ee — . (20) 
dq 

The assumption of nondegeneracy is that we can invert the phase space trans- 
formation 12U|) to solve for q in terms of Q and P. That is, there exists a 
function v(Q,P) such that, 



dL 
dq 



P. (21) 



The canonical Hamiltonian is obtained by Legendre transforming on q, 

H(Q,P) = Pq-L, (22) 

= Pv{Q,P)-l(q,v{Q,P)) ■ (23) 

It is easy to check that the canonical evolution equations reproduce the inverse 
phase space transformation (|21|l and the Euler-Lagrange equation i|18|) , 

• dH dv dL dv , . 

Q ^l)P= V + P dp-TqdP =V ' (M) 

dQ dQ dq dqdP dq ■ y ' 

This is what we mean by the statement, "the Hamiltonian generates time evo- 
lution." When the Lagrangian has no explicit time dependence, H is also the 
associated conserved quantity. Hence it is "the" energy by anyone's definition, 
of course up to canonical transformation. 

Now consider a system whose Lagrangian L(q, q, q) depends nondegener- 
ately upon q. The Euler-Lagrange equation is, 

dL_d_dL d^dL_„ 

~dq~ ~ Jt~dq~ + ~ ■ ( ' 
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Non-degeneracy implies that ^4 depends upon q, in which case we can cast 
H26|) in a form radically different from Newton's, 



<f- =F(q,q,q,q^) =► q(t) = Q(t,q ,q ,q ,q^) . (27) 



,(4) 



Because solutions now depend upon four pieces of initial value data there 
must be four canonical coordinates. Ostrogradski's choices for these are, 

„ dL d dL . . 

• P ^irq-dtirv (28) 

dL 

Q2 = q , P2 =ftj- ( 29 ) 

The assumption of nondegeneracy is that we can invert the phase space trans- 
formation H28I29|1 to solve for q in terms of Q\, Q 2 and P 2 . That is, there exists 
a function a(Qi, Q 2 , P2) such that, 



dL 



= P 2 . (30) 

q = Ql 

q=Q 2 

q = a 



Note that one only needs the function a(Q\,Q 2 , P 2 ) to depend upon three 
canonical coordinates — and not all four — because L(q,q,q) only depends 
upon three configuration space coordinates. This simple fact has great conse- 
quence. 

Ostrogradski's Hamiltonian is obtained by Legendre transforming, just as 
in the first derivative case, but now on q = q^ and q = q^ 2 \ 

2 

H(Q 1 ,Q 2 ,P 1 ,P 2 ) = Y / P i q^ -L, (31) 

i=l 

= P1Q2 + P2a(Qi,Q 2 ,P2) - L(Q 1 ,Q 2 ,a(Q 1 ,Q 2 ,P 2 )) . (32) 



The time evolution equations are just those suggested by the notation, 

dH dH 

~d~P l and P ^-dQl- ^ 
Let's check that they generate time evolution. The evolution equation for Qi? 

OH 

Qi = q Fi =Q2, (34) 

reproduces the phase space transformation q = Q 2 in (|29(l . The evolution 
equation for Q 2 , 

■ dH n da dL da /nr . 

l2 = — = a + P 2 —-——=a, (35) 



dP 2 *dP 2 dqdP 2 
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reproduces i|3U[l . The evolution equation for P 2 , 

dH da dL dL da dL , s 

p 2^7r + + ^7^tt =-Pi + ^r , (36) 



dQ 2 dQ 2 dq dq dQ 2 dq 

dL _ d dL 
dq dt dq 



reproduces the phase space transformation Pi = Ok — 4j^k (|28p. And the 



evolution equation for P\, 

b __ dH 9a dL dL da dL 

Pl ~ ~dQ~i ~ ~ P ' 2 dCh + ~dq + ~dq 7 dQ~i ~ ~dq ' (3?) 

reproduces the Euler-Lagrange equation l|26|) . So Ostrogradski's system re- 
ally does generate time evolution. When the Lagrangian contains no explicit 
dependence upon time it is also the conserved Noether current. By anyone's 
definition, it is therefore "the" energy, again up to canonical transformation. 

There is one, overwhelmingly bad thing about Ostrogradski's Hamiltonian 
l|32l) : it is linear in the canonical momentum P\ . This means that no system 
of this form can be stable. In fact, there is not even any barrier to decay. Note 
also the power and generality of the result. It applies to every Lagrangian 
L(q,q,q) which depends nondegenerately upon q, independent of the details. 
The only assumption is nondegeneracy, and that simply means one cannot 
eliminate q by partial integration. This is why Newton was right to assume 
the laws of physics take the form (|19|) when expressed in terms of fundamental 
dynamical variables. 

Adding more higher derivatives just makes the situation worse. Consider 
a Lagrangian L (g, q, . . . , q^) which depends upon the first iV derivatives of 
q(t). If this Lagrangian depends nondegenerately upon q( N ^ then the Euler- 
Lagrange equation, 

contains q^ 2N '. Hence the canonical phase space must have 2N coordinates. 
Ostrogradski's choices for them are, 

<W«> and (39, 

j=i 

Non-degeneracy means we can solve for q( N ' in terms of Pn and the Qi's. 
That is, there exists a function A(Q%, . . . , Qn, Pn) such that, 



dL 



d q W 



= Pn ■ (40) 



For general N Ostrogradski's Hamiltonian takes the form, 



8 R. P. Woodard 



N 

H = Y,Pi<l^-L, (41) 

= PiQ 2 + P 2 Q 3 + --- + P N -iQ n + P n A-l(q 1 ,...,Q n ,A) . (42) 
It is simple to check that the evolution equations, 

Ct<= m and * = (43) 

again reproduce the canonical transformations and the Euler-Lagrange equa- 
tion. So H42fl generates time evolution. Similarly, it is Noether current for the 
case where the Lagrangian contains no explicit time dependence. So there 
is little alternative to regarding l|42(l as "the" energy, again up to canonical 
transformation. 

One can see from Q42JI that the Hamiltonian is linear in Pi, P2, . . . Pjv-i- 
Only with respect to P/v might it be bounded from below. Hence the Hamil- 
tonian is necessarily unstable over half the classical phase space for large N\ 



3 Common Misconceptions 

The no-go theorem I have just reviewed ought to come as no surprise. It 
explains why Newton was right to expect that physical laws take the form 
of second order differential equations when expressed in terms of fundamen- 
tal dynamical variables. 2 Every fundamental system we have discovered since 
Newton's day has had this form. The bizarre, dubious thing would be if New- 
ton had blundered upon a tiny subset of possible physical laws, and all our 
probing over the course of the next three centuries had never revealed the 
vastly richer possibilities. However — deep sigh — particle theorists don't like 
being told something is impossible, and a definitive no-go theorem such as 
that of Ostrogradski provokes them to tortuous flights of evasion. I ought to 
know, I get called upon to referee the resulting papers often enough! No one 
has so far found a way around Ostrogradski's theorem. I won't attempt to 
prove that no one ever will, but let me use this section to run through some 
of the misconceptions which have been in back of attempted evasions. 

To fix ideas it will be convenient to consider a higher derivative general- 
ization of the harmonic oscillator, 



gm 2 m 2 



rauj 



2 



Here m is the particle mass, u> is a frequency and g is a small positive pure 
number we can think of as a coupling constant. The Euler-Lagrange equation, 



2 The caveat is there because one can always get higher order equations by solving 
for some of the fundamental variables. 
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m 



has the general solution, 

q(t) = A + cos{k + t) + B + sin(k + t) + A_ cos(fc_i) + B_ sm(k_t) . (46) 
Here the two frequencies are, 



(47) 



2.9 



and the initial value constants are, 



k\-k 2 _ ' fc_(fc2 _fc2) • ^ 



The conjugate momenta are, 

P l =mq + ^-W 3 > <S> g< 8 ) = 1 — , 50 

am cj 2 Pt 
P 2 = —^q «= -. 51 

The Hamiltonian can be expressed in terms of canonical variables, configura- 
tion space variables or initial value constants, 



it vn w% P 2 m 2 mu 2 2 

H = PiQ 2 - P 2 - 77^2 + —^-Qi > ( 52 ) 

2<?m 2 2 



ff m . (3) gm .. 2 m . 2 mw' 
.^Tgk\{A\ + BX) ^^r 9 k 2 _(A 2 _+B 2 _) . (54) 



y"" ■ 3 -2 , "* -2 i " lw 2 /r \ 



The last form makes it clear that the "+" modes carry positive energy whereas 
the "— " modes carry negative energy. 



3.1 Nature of the Instability 

It's important to understand both how the Ostrogradskian instability mani- 
fests and what is physically wrong with a theory which shows this instability. 
Because the Ostrogradskian Hamiltonian l|42|) is not bounded below with re- 
spect to more than one of its conjugate momenta, one sees that the problem 
is not reaching arbitrarily negative energies by setting the dynamical variable 
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to some constant value. Rather it is reaching arbitrarily negative energies by 
making the dynamical variable have a certain time dependence. People some- 
times mistakenly believe they have found a higher derivative system which is 
stable when all they have checked is that the Hamiltonian is bounded from 
below for constant field configurations. For example, from expression (|53|l we 
see that our higher derivative oscillator energy is bounded below by zero for 
q(t) — const! Negative energies are achieved by making q large and/or making 
q(3) i ar g e w hile keeping q+gq^ /lo 2 fixed. 

Another crucial point is that the same dynamical variable typically carries 
both positive and negative energy degrees of freedom in a higher derivative 
theory. For our higher derivative oscillator this is apparent from expression 
l|46l) which shows that q(t) involves both the positive energy degrees of free- 
dom, A + and B +1 and the negative energy ones, A_ and £?_. And note from 
expression (|54l) that I really mean positive and negative energy, not just pos- 
itive and negative frequency, which is the usual case in a lower derivative 
theory. 

People sometimes imagine that the energy of a higher derivative theory 
decays with time. That is not true. Provided one is dealing with a complete 
system, and provided there is no external time dependence, the energy of a 
higher derivative system is conserved, just as it would be under those con- 
ditions for a lower derivative theory. This conservation is apparent for our 
higher derivative oscillator from expression (|54|l . 

The physical problem with nondegenerate higher derivative theories is not 
that their energies decay to lower and lower values. The problem is rather 
that certain sectors of the theory become arbitrarily highly excited when one 
is dealing with an interacting, continuum field theory which has nondegenerate 
higher derivatives. To understand this I must digress to remind you of some 
familiar facts about the Hydrogen atom. 

If you consider Hydrogen in isolation, there is an infinite tower of sta- 
tionary states. However, if you allow the Hydrogen atom to interact with 
electromagnetism only the ground state is stationary; all the excited states 
decay through the emission of a photon. Why is this so? It certainly is not 
because "the system wants to lower its energy." The energy of the full system 
is constant, the binding energy released by the decaying atom being compen- 
sated by the energy of the recoil photon. Yet the decay always takes place, 
and rather quickly. The reason is that decay is terrifically favored by entropy. 
If we prepare the Hydrogen atom in an excited state, with no photons present, 
there is one way for the atom to remain excited, whereas there are an infinite 
number of ways for it to decay because the recoil photon could go off in any 
direction. 

Now consider an interacting, continuum field theory which possesses the 
Ostrogradskian instability. In particular consider its likely particle spectrum 
about some "empty" solution in which the field is constant. Because the 
Hamiltonian is linear in all but one of the conjugate momenta we can in- 
crease or decrease the energy by moving different directions in phase space. 
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Hence there must be both positive energy and negative energy particles - 
just as there are in our higher derivative oscillator. Just as in that point parti- 
cle model, the same continuum field must carry the creation and annihilation 
operators of both the positive and the negative energy particles. If the the- 
ory is interacting at all — that is, if its Lagrangian contains a higher than 
quadratic power of the field — then there will be interactions between posi- 
tive and negative energy particles. Depending upon the interaction, the empty 
state can decay into some collection of positive and negative energy particles. 
The details don't really matter, all that matters is the counting: there is one 
way for the system to stay empty versus a continuous infinity of ways for it 
to decay. This infinity is even worse than for the Hydrogen atom because it 
includes not only all the directions that recoil particles of fixed energies could 
go but also the fact that the various energies can be arbitrarily large in mag- 
nitude provided they sum to zero. Because of that last freedom the decay is 
instantaneous. And the system doesn't just decay once! It is even more en- 
tropicly favored for there to be two decays, and better yet for three, etc. You 
can see that such a system instantly evaporates into a maelstrom of positive 
and negative energy particles. Some of my mathematically minded colleagues 
would say it isn't even defined. I prefer to simply observe that no theory of 
this kind can describe the universe we experience in which all particles have 
positive energy and empty space remains empty. 

Note that we only reach this conclusion if the higher derivative theory pos- 
sesses both interactions and continuum particles. Our point particle oscillator 
has no interactions, so its negative energy degree of freedom is harmless. Of 
course it is also completely unobservable! However, it is conceivable we could 
couple this higher derivative oscillator to a discrete system without engen- 
dering an instability. The feature that drives the instability when continuum 
particles are present is the vast entropy of phase space. Without that it be- 
comes an open question whether or not there is anything wrong with a higher 
derivative theory. Of course we live in a continuum universe, and any degree 
of freedom we can observe must be interacting, so these are very safe assump- 
tions. However, people sometimes delude themselves that there is no problem 
with continuum, interacting higher derivative models of the universe on the 
basis of studying higher derivative systems which could never describe the 
universe because they either lack interactions or else continuum particles. 

In this sub-section we have learned: 

1. The Ostrogradskian instability does not drive the dynamical variable to 
a special, constant value but rather to a special kind of time dependence. 

2. A dynamical variable which experiences the Ostrogradskian instability 
will carry both positive and negative energy creation and annihilation 
operators. 

3. If the system interacts then the "empty" state can decay into a collection 
of positive and negative energy excitations. 
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4. If the system is a continuum field theory the vast entropy at infinite mo- 
mentum will make the decay instantaneous. 



3.2 Perturbation Theory 

People sometimes mistakenly believe that the Ostrogradskian instability is 
avoided if higher derivatives are segregated to appear only in interaction terms. 
This is not correct if one considers the theory on a fundamental level. One can 
see from the construction of section [5] that the fact of Ostrogradski's Hamilto- 
nian being unbounded below depends only upon nondegeneracy, irrespective 
of how one organizes any approximation technique. However, there is a way 
of imposing constraints to make the theory agree with its perturbative devel- 
opment. If this is done then there are no more higher derivative degrees of 
freedom, however, one typically loses unitarity, causality and Lorentz invari- 
ance on the nonperturbative level. 

I constructed the higher derivative oscillator Q44|l so that its higher deriva- 
tives vanish when g = 0. If we solve the Euler-Lagrange equation l|45|l exactly, 
without employing perturbation theory, there are four linearly independent 
solutions (|46|) corresponding to a positive energy oscillator of frequency k + 
and a negative energy oscillator of frequency /c_. However, we might instead 
regard the parameter g as a coupling constant and solve the equations per- 
turbatively. This means substituting the ansatz, 

oo 

QpcrtW = I>" X "W , (55) 
n=0 

into the Euler-Lagrange equation (|45|l and segregating terms according to 
powers of g. The resulting system of equations is, 

x + uj 2 x Q = , (56) 
x\ +lo 2 x\ = ^x^ , (57) 

x 2 +lo 2 x 2 = -\x{ 4) , (58) 

and so on. Because the zeroth order equation involves only second derivatives, 
its solution depends upon only two pieces of initial value data, 



xo(t) = qo cos(wi) + — sin(wi) . (59) 
to 



The first correction is, 



x i( f ) = --^Qosin(ut) + ^qocos(wt) - ^—q Q sin(u;t) , (60) 



and it is easy to see that the sum of all corrections gives, 
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? P ert(*) = qo cos(fc+<) + sin(fc+i) . (61) 

k+ 

What is the relation of the perturbative solution (|61|) to the general one 
1461) ? The perturbative solution is what results if we change the theory by 
imposing the constraints, 

q(t) = -k 2 +q(t) P 2 = ^(l-V 7 ! 3 ^)^! . (62) 

3< 3) (t) = Pi = y(n-V^ Z 4i)02- (63) 

Under these constraints the Hamiltonian becomes, 

, /m o rak 2 , „\ 

ffpcrt = VT^( ^Ql + -^Ql) - (64) 

which is indeed that of a single harmonic oscillator. From the full theory, 
perturbation theory has retained only the solution whose frequency is well 
behaved for g — ► 0, 

k+=u:(l+ l -g + I g 2 + 0(. 9 3 )) . (65) 
It has discarded the solution whose frequency blows up as g — ■> 0, 

*- = ^(1-^-^ + 0(9")). (66) 

So what's wrong with this? In fact there is nothing wrong with the pro- 
cedure for our model. If the constraints I|62I63|I arc imposed at one instant, 
they remain valid for all times as a consequence of the full equation of motion. 
However, that is only because our model is free of interactions. Recall that 
this same feature means the positive and negative energy degrees of freedom 
exist in isolation of one another, and there is no decay to arbitrarily high 
excitation as there would be for an interacting, continuum field theory. 

When interactions are present it is more involved but still possible to 
impose constraints which change the theory so that only the lower derivative, 
perturbative solutions remain. The procedure was first worked out by Jaen, 
Llosa and Molina [23, and later, independently, by Eliezer and me [20] - T° 
understand its critical defect suppose we change the "interaction" of our higher 
derivative oscillator from a quadratic term to a cubic one, 

gm „ 2 gm 3 

Here £ is some constant with the dimensions of a length. As with the quadratic 
interaction, the new equation of motion is fourth order, 



<P ( 9<i 2 \ ■■ 2 



, (68) 
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Its general solution depends upon four pieces of initial value data. However, 
by isolating the highest derivative term of the free theory, 




and then iteratively substituting (|69|l . we can delay the appearance of higher 
derivatives on the right hand side to any desired order in the coupling constant 
g. For example, two iterations frees the right hand side of higher derivatives 
up to order g 2 , 




This obviously becomes complicated fast! However, the lower derivative terms 
at order g 2 are simple enough to give if I don't worry about the higher deriva- 
tive remainder, 

q= -<A+§ + 9 ^(-6ujV + U qq 2 ) +0(g 3 ) . (72) 

If we carry this out to infinite order, and drop the infinite derivative remainder, 
the result is an equation of the traditional form, 

$ = /(?,«)• (73) 

The canonical version of this equation gives the first of the desired constraints. 
The second is obtained from the canonical version of its time derivative. 

The constrained system we have just described is consistent on the per- 
turbative level, but not beyond. It does not follow from the original, exact 
equation. That would be no problem if we could define physics using pertur- 
bation theory, but we cannot. Perturbation theory does not converge for any 
known interacting, continuum field theory in 3 + 1 dimensions! The fact that 
the constraints are not consistent beyond perturbation theory means there is 
a nonperturbative amplitude for the system to decay to the arbitrarily high 
excitation in the manner described in sub-section 13. II The fact that the con- 
straints treat time derivatives differently than space derivatives also typically 
leads to a loss of causality and Lorentz invariance beyond perturbation theory. 

A final comment concerns the limit of small coupling constant, i.e., g — > 0. 
One can see from the frequencies <|65I66[1 of our higher derivative oscillator that 
the negative energy frequency diverges for g — > 0. Disingenuous purveyors of 
higher derivative models sometimes appeal to people's experience with positive 
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energy modes by arguing that, "the k_ mode approaches infinite frequency 
for small coupling so it must drop out." That is false! The argument is quite 
correct for an infinite frequency positive energy mode in a stable theory. In 
that case exciting the mode costs an infinite amount of energy which would 
have to be drawn from de-exciting finite frequency modes. However, a negative 
energy mode doesn't decouple as its frequency diverges. Rather it couples 
more strongly because taking its frequency to infinity opens up more and 
more ways to balance its negative energy by exciting finite frequency, positive 
energy modes. 



3.3 Quantization 

People sometimes imagine that quantization might stabilize a system against 
the Ostrogradskian instability the same way that it does for the Hydrogen 
atom coupled to electromagnetism. This is a failure to understand correspon- 
dence limits. Conclusions drawn from classical physics survive quantization 
unless they depend upon the system either being completely excluded from 
some region of the canonical phase space or else inhabiting only a small region 
of it. For example, the classical instability of the Hydrogen atom (when cou- 
pled to electromagnetism) derives from the fact that the purely Hydrogenic 
part of the energy, 

~ -*n~ ~ p| ' (74) 

can be made arbitrarily negative by placing the electron close to the nucleus 
at fixed momentum. Because this instability depends upon the system being 
in a very small region of the canonical phase space, one might doubt that it 
survives quantization, and explicit computation shows that it does not. 

In contrast, the Ostrogradskian instability derives from the fact that P\Qi 
can be made arbitrarily negative by taking Pj either very negative, for positive 
Q2, or else very positive, for negative Q%. This covers essentially half the 
classical phase space! Further, the variables Q2 and P\ commute with one 
another in Ostrogradskian quantum mechanics. So there is no reason to expect 
that the Ostrogradskian instability is unaffected by quantization. 



3.4 Unitarity vs. Instability 

Particle physicists who quantize higher derivative theories don't typically rec- 
ognize a problem with the stability. They maintain that the problem with 
higher derivatives is a breakdown of unitarity. In this sub-section I will again 
have recourse to the higher derivative oscillator Q44JI to explain the connection 
between the two apparently unrelated problems. 

Let us find the "empty" state wavefunction, f2(Qi,Q2) that has the min- 
imum excitation in both the positive and negative energy degrees of freedom. 
The procedure for doing this is simple: first identify the positive and negative 
energy lowering operators a± and then solve the equations, 
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a+\Q) = = a_\f2) 



(75) 



We can recognize the raising and lowering operators by simply expressing the 
general solution l|46|l in terms of exponentials, 

q(t) = i(A + +* J B + )e-^ t + i(A + - 4 S + )e lfc +* 

+-(A_ +iB_)e- lk - t + ^{A_-iB_)e ik - 1 . (76) 

Recall that the k+ mode carries positive energy, so its lowering operator must 
be proportional to the e~ tk+t term, 



a + r~j A + + iB + , 



(77) 



(l + v/l-4 5 ) Qi + iPi - k+P 2 - ™ (l - y/l-Ag) Q 2 ■ (78) 



The k^ mode carries negative energy, so its lowering operator must be pro- 
portional to the e +lk - 1 term, 



A- - iB- , 
mk 



(79) 



l-Vl-4flj0i - iPi - k-P 2 + — • (80) 

Writing Pj = —i-^- we see that the unique solution to (|75|) has the form, 



f2(Q 1 ,Q 2 )=Nexp 



(k + k-Ql + Qfj - i y /gmQ 1 Q 2 



2(fc+ + fc_) 



(81) 



The empty wave function (|81|) is obviously normalizable, so it gives a state 
of the quantum system. We can build a complete set of normalized stationary 
states by acting arbitrary numbers of + and — raising operators on it, 



x {a\_W (a f ) N 
\N+,N-) = +J+ V 



\n) 



(82) 



On this space of states the Hamiltonian operator is unbounded below, just as 
in the classical theory, 



H\N + ,N_) = [N + k+ - N-k-)\N + ,N. 



(83) 



This is the correct way to quantize a higher derivative theory. One evidence 
of this fact is that classical negative energy states correspond to quantum 
negative energy states as well. 

Particle physicists don't quantize higher derivative theories as we just have. 
What they do instead is to regard the negative energy lowering operator as 
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a positive energy raising operator. So they define a "ground state" \Q) which 
obeys the equations, 

a + \T2) = = al\Q) . (84) 
The unique wave function which solves these equations is, 



n(Q 1 ,Q 2 )=Nexp 



2(/c_-fc+) 



(85) 



This wave function is not normalizable, so it doesn't correspond to a state of 
the quantum system. At this stage we should properly call a halt to the anal- 
ysis because we aren't doing quantum mechanics anymore. The Schrodinger 
equation Hijj(Q) = Eip(Q) is just a second order differential equation. It has 
two linearly independent solutions for every energy E: positive, negative, real, 
imaginary, quaternionic — it doesn't matter. The thing that puts the "quan- 
tum" in quantum mechanics is requiring that the solution be normalizable. 
Many peculiar things can happen if we abandon allow normalizability |21II22| . 

However, my particle theory colleagues ignore this little problem and define 
a completely formal "space of states" based upon | J?) , 



\N+,N-) = v t_ v . (86) 

None of these wavefunctions is any more normalizable than Q2), so not 

a one of them corresponds to a state of the quantum system. However, they 
are all positive energy eigenfunctions, 

H\N+,N-) = (N+k+ + N-kS) \N+,N-) . (87) 

My particle physics colleagues typically say they define \Q) to have unit norm. 
Because they have not changed the commutation relations, 

[a+,4]=l = [a_,al], (88) 

the norm of any state with odd iV_ is negative! The lowest of these is, 

(D7TI07I) = <72|Q;LQi_]72> = -<1?|77> . (89) 

As I pointed out above, the reason this has happened is that we aren't do- 
ing quantum mechanics any more. We ought to use the normalizable, but 
indefinite energy eigenstates. What particle physicists do instead is to reason 
that because the probabilistic interpretation of quantum mechanics requires 
norms to be positive, the negative norm states must be excised from the space 
of states. At this stage good particle physicists note that that the result- 
ing model fails to conserve probability |23| . Just as the correctly-quantized, 
indefinite-energy theory allows processes which mix positive and negative en- 
ergy particles, so too the indefinite-norm theory allows processes which mix 
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positive and negative norm particles. It only conserves probability on the space 
of "states" which includes both kinds of norms. If we excise the negative norm 
states then probability is no longer conserved. 

So good particle physicists reach the correct conclusion — that nondegen- 
erate higher derivative theories can't describe our universe — by a somewhat 
illegitimate line of reasoning. But who cares? They got the right answer! Of 
course bad particle physicists regard the breakdown of unitarity as a challenge 
for inspired tinkering to avoid the problem. Favorite ploys are the Lee- Wick 
reformulation of quantum field theory 24 and nonperturbative resumma- 
tions. The analysis also typically involves the false notion that high frequency 
ghosts decouple, which I debunked at the end of sub-section 13. 21 When the fi- 
nal effort is written up and presented to the world, some long-suffering higher 
derivative expert gets called away from his research to puzzle out what was 
done and explain why it isn't correct. Sigh. The problem is so much clearer 
in its negative energy incarnation! I could list many examples at this point, 
but I will confine myself to citing a full-blown paper debunking one of them 
[2S] . It is also appropriate to note that Hawking and Hertog have previously 
called attention to the mistake of quantizing higher derivative theories using 
nonnormalizable wave functions |2fi| . 



3.5 Constraints 

The only way anyone has ever found to avoid the Ostrogradskian instabil- 
ity on a nonperturbative level is by violating the single assumption needed 
to make Ostrogradski's construction: nondegeneracy. Higher derivative theo- 
ries for which the definition of the highest conjugate momentum (|4t)(l cannot 
be inverted to solve for the highest derivative can sometimes be stable. An 
interesting example of this kind is the rigid, relativistic particle studied by 
Plyushchay [27|EB1 

Degeneracy is of great importance because all theories which possess con- 
tinuous symmetries are degenerate, irrespective of whether or not they possess 
higher derivatives. A familiar example is the relativistic point particle, whose 
dynamical variable is X^(t) and whose Lagrangian is, 



L = -my -r}^X»X» . (90) 
The conjugate momentum is, 

„ mX n 

P„ = -7== • (91) 

V-x 2 

Because the right hand side of this equation is homogeneous of degree zero 
one can not solve for X 11 . The associated continuous symmetry is invariance 
under reparameterizations r — > t'(t), 

X"(t) — » X'"(t) = A^(t' _1 (t)) . (92) 
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The cure for symmetry-induced degeneracy is simply to fix the symmetry 
by imposing gauge conditions. Then the gauge-fixed Lagrangian should no 
longer be degenerate in terms of the remaining variables. For example, we 
might parameterize so that r = X (r), in which case the gauge-fixed particle 
Lagrangian is, 

L GF = -mVl-X-X . (93) 
In this gauge the relation for the momenta is simple to invert, 

Pj= , "*« ^ X*= - ^ . (94) 

\/l-X-X Vm 2 + P ■ P 

When a continuous symmetry is used to eliminate a dynamical variable, 
the equation of motion of this variable typically becomes a constraint. For sym- 
metries enforced by means of a compensating field — such as local Lorentz 
invariance is with the antisymmetric components of the vierbein |29j — the 
associated constraints are tautologies of the form = 0. Sometimes the con- 
straints are nontrivial, but implied by the equations of motion. An example 
of this kind is the relativistic particle in our synchronous gauge. The equation 
of the gauge-fixed zero-component just tells us the Hamiltonian is conserved, 

mXn 




X»X V , 



= 0— ► — ^m 2 +p-pj = 0. (95) 



And sometimes the constraints give nontrivial relations between the canonical 
variables that generate residual, time-independent symmetries. In this case 
another degree of freedom can be removed ("gauge fixing counts twice," as 
van Nieuwcnhuizcn puts it). An example of this kind of constraint is Gauss' 
Law in temporal gauge electrodynamics. 

Were it not for constraints of this last type, the analysis of a higher deriva- 
tive theory with a gauge symmetry would be straightforward. One would sim- 
ply fix the gauge and then check whether or not the gauge-fixed Lagrangian 
depends nondegenerately upon higher time derivatives. If it did, the conclu- 
sion would be that the theory suffers the Ostrogradskian instability. However, 
when constraints of the third type are present one must check whether or 
not they affect the instability. This is highly model dependent but a very 
simple rule seems to be generally applicable: if the number of gauge con- 
straints is less than the number of unstable directions in the canonical phase 
space then there is no chance for avoiding the problem. Because the number 
of constraints for any symmetry is fixed, whereas the number of unstable di- 
rections increases with the number of higher derivatives, one consequence is 
that gauge constraints can at best avoid instability for some fixed number 
of higher derivatives. For example, the constraints of the second derivative 
model of Plyushchay are sufficient to stabilize the system |271 128|. but one 
would expect it to become unstable if third derivatives were added. 
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People sometimes make the mistake of believing that the Ostrogradskian 
instability can be avoided with just a single, global constraint on the Hamil- 
tonian. For example, Boulware, Horowitz and Strominger |3(J| showed the 
energy is zero for any asymptotically flat solution of the higher derivative 
field equations derived from the Lagrangian, 

C = aR 2 ^j + PR^R^^f^g . (96) 

As I explained in sub-section 13. II the nature of the Ostrogradskian instability 
is not that the energy decays but rather that the system evaporates to a very 
highly excited state of compensating, positive and negative energy degrees 
of freedom. As long as ^ 0, there are six independent, higher derivative 
momenta at each space point, whereas there are only four local constants — 
or five if a and /3 are such as to give local conformal invariance. Hence there 
are two (or one) unconstrained instabilities per space point. There are an 
infinite number of space points, so the addition of a single, global constraint 
does not change anything. I should point out that Boulware, Horowitz and 
Strominger were aware of this, cf. their discussion of the dipole instability. 

The case of (3 = is special, and significant for the next section. If a has 
the right sign that model has long been known to have positive energy |.'illl32| . 
This result in no way contradicts the previous analysis. When (3 = the terms 
which carry second derivatives are contracted in such way that only a single 
component of the metric carries higher derivatives. So now the counting is 
one unstable direction per space point versus four local constraints. Hence 
the constraints can win, and they do if a has the right sign. 

3.6 Nonlocality 

I would like to close this section by commenting on the implications of Ostro- 
gradski's theorem for fully nonlocal theories. In addition to nonlocal quantum 
field theories C2J OH this is relevant to string field theory [3(3 123 EU , to 
noncommutative geometry |3!5] EH]j to regularization techniques [411 1421 |4"3] 
and even to theories of cosmology |15H44ll43] , The issue in each case is whether 
or not we can think of the fully nonlocal theory as the limit of a sequence of 
ever higher derivative theories. When such a representation is possible the 
nonlocal theory must inherit the Ostrogradskian instability. 

The higher derivative representation is certainly valid for string field theory 
because, otherwise, there would be cuts and poles that would interfere with 
perturbative unitarity. So string field theory suffers from the Ostrogradskian 
instability [20| . The same is true for theories where the nonlocality is of limited 
extent in time J^Sj , although not everyone agrees However, when the 

nonlocality involves inverse differential operators there need be no problem 
(20] Indeed, the effective action of any quantum field theory is nonlocal in 
this way 49, 50 ! Nor is there necessarily any problem when the nonlocality 
arises in the form of algebraic functions of local actions 51 . 
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4 AR[g] = f(R) Theories 

From the lengthy argumentation of the previous two sections one might con- 
clude that the only potentially stable, local modification of gravity is a cos- 
mological constant, Z\i?[.g] = —2A. However, a close analysis of sub-section 
13.51 reveals that it is also possible to consider algebraic functions of the Ricci 
scalar. In this section I first explain why such theories can avoid the Ostro- 
gradskian instability. I then demonstrate that they are equivalent to general 
relativity with a minimally coupled scalar, provided we ignore matter. Finally, 
I exploit this equivalence, with the construction described in the Introduction, 
to show how f(R) can be chosen to enforce any evolution a(t). 

4.1 Why They Can Be Stable 

The alert reader will have noted that the R + R 2 model [311 152) avoids the 
Ostrogradskian instability. It does this by violating Ostrogradski's assumption 
of nondegeneracy: the tensor indices of the second derivative terms in the 
Ricci scalar are contracted together so that only a single component of the 
metric carries higher derivatives. This component does acquire a new, higher 
derivative degree of freedom, and the energy of this degree of freedom is indeed 
opposite to that of the corresponding lower derivative degree of freedom, just 
as required by Ostrogradski's analysis. However, that lower derivative degree 
of freedom is the Newtonian potential. It carries negative energy, but it is also 
completely fixed in terms of the other metric and matter fields by the goo 
constraint. So the only instability associated with it is gravitational collapse. 
Its higher derivative counterpart has positive energy, at least on the kinetic 
level; it can still have a bad potential, and the model is indeed only stable for 
one sign of the R 2 term. 

None of these features depended especially upon the higher derivative term 
being R 2 . Any function for the Ricci scalar would work as well. Note that we 
cannot allow derivatives of the Ricci scalar, because Ostrogradski's theorem 
says the next higher derivative degree of freedom would carry negative energy 
and there would be no additional constraints to protect it. We also cannot 
permit more general contractions of the Ricmann tensor because then other 
components of the metric would carry higher derivatives. These components 
are positive energy in general relativity, so their higher derivative counterparts 
would be negative, and there would again be no additional constraints to 
protect the theory against instability. 

4.2 Equivalent Scalar Representation 

The general Lagrangian we wish to consider takes the form, 



(97) 
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If we ignore the coupling to matter the modified gravitational field equation 
consists of the vanishing of the following tensor, 

1(,;y(t ,)S [i+rmR^-\[R+fm9^+9Af\R)r P -[f'{R)]^. m 



7=5 5g^ 

There is an old procedure for reformulating this as general relativity with a 
minimally coupled scalar. I don't know whom to credit, but I will give the 
construction. 

The first step is to define an "equivalent" theory with an auxiliary field <f> 
which is defined by the relation. 



= ! + /'(#) 



r = n{<j>) 



(99) 



Inverting the relation determines the Ricci scalar as an algebraic function of <fi. 
We can then define an auxiliary potential for tf> by Legendre transformation, 



u{ct>) = {j>-i)n{cj>) -f{n{ct>) 



=J- U\4>) = 71(0) . (100) 
Now consider the equivalent scalar-tensor theory whose Lagrangian is, 

Ce^j^^R-U^))^. (101) 



Its field equations are, 
16ttG SS e 



V~9 °<P 
16ttG SS e 



= R-U'(4>) = 



(102) 



<j>R 



- t)(j/ll , l(cl>R-U( ( t>))g IM u+9^4>' p p -^u=0. (103) 

The scalar equation (|102|l implies 4>— l+f'(R), whereupon the tensor equations 
l|103(l reproduce the original modified gravity equations 

The final step is to define a new metric g^ and a new scalar <p by the 
change of variables, 



9tiv 



4ttG 



exp 



4ttG 



■ tp 



exp 



4ttG 



In terms of these variables the equivalent Lagrangian takes the form, 



C E = 



1 



;R 



16ttG 

where the scalar potential is, 



9- ^(pd^g^ 



9 • 



V(<p) 



1 



16ttG 



U\ exp 



r UnG " 




r /16ttG I 


IV 3 V . 


J exp 


. V 3 H 



(104) 
(105) 

(106) 
(107) 



This is general relativity with a minimally coupled scalar, as claimed. 
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4.3 Reconstructing f(R) from Cosmology 

I want to show how to choose f(R) to support an arbitrary a(t). 3 Recall 
from the Introduction that one can choose the potential of a quintessence 
model such as (|10fif) to support any homogeneous and isotropic cosmology 
for its metric g^v However, we cannot immediately exploit this construction 
because it is the metric g^ u which is assumed known, not . We must explain 
how to infer the one from the other without knowing f(R). 

Because the relation <|104f) between g^ v and g^ is a conformal transfor- 
mation, it makes sense to work in a coordinate system in which each metric 
is conformal to flat space. This is accomplished by changing from co-moving 
time t to conformal time r\ though the relation, dr\ — dt/a(t), 

ds 2 = -dt 2 + a 2 (t)dx ■ dx = a 2 (~dr) 2 + dx ■ dx) . (108) 



The g^v element takes the same form in conformal coordinates, but note that 
its different scale factor implies a different co-moving time, 



a 



(-drf + dx ■ dxj = -dt 2 + d 2 (t )dx ■ dx . (109) 



From relation (jl()4fl we infer, 

o(t) = a(t)exp[-J^tpo(t)\ • (HO) 

We denote differentiation with respect to r\ by a prime, and one should 
note the relation between derivatives with respect to the various times, 

d d „d 

— = a— = a— . HI 

dr] dt dt V ' 

Differentiating the logarithm of (|110f> with respect to r\ and using the relation 
JHJ between a and ipo gives, 



a' a' ttG , a' / 1 , „, 

a a V o a \ 12 

This is a nonlinear but first order differential equation for the variable a in 
terms of the known function, a(t(r])). At the worst it can be solved numerically. 

Once we have a the potential V(ip) can be constructed using the procedure 
explained in the Introduction. We then compute the auxiliary potential, 

U{4>) = 16nG^ 2 v[^^ ln(0)) . (113) 



3 For a somewhat different construction which achieves the same end, see |17II52| . 
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The auxiliary field can be expressed in terms of the Ricci scalar from the 
algebraic relation, 

U'{4>) = R (j)= $(R) . (114) 

And we finally recover the function f(R) by Legrendre transformation, 

f(R) = (${R)-\)R - U($(R)) . (115) 



5 Problems with f(R) = -t£ 

In view of the construction of sub-section 14.31 it is not surprising but rather 
inevitable that an f(R) can be found to support late time acceleration, or 
indeed, any other evolution. However, the method is not guaranteed to pro- 
duce a simple model, so the discovery that f(R) = —/j, 4 /R works is quite 
noteworthy 53 , 5|j . 4 It may also be significant that models of this type seem 
to follow from fundamental theory |56) . 

To derive acceleration in this model consider its field equations, 

4 i 4 4 

Setting T^ v = and searching for constant Ricci scalar solutions gives, 

4 i 4 /o 

(1+^)^-^(1-^)^ = <=► R„ v = ±^fg^ . (117) 

The plus sign corresponds to acceleration. 

In addition to proposing the model, Carroll, Duvvuri, Trodden and Turner 
|53j also showed that it suffers from a very weak tachyonic instability in the 
absence of matter. Because the only new higher derivative degree of freedom 
resides in the Ricci scalar, we may as well derive an equation for it alone from 
the trace of (|116|l . 

-* + £ + n(3£)-o. (ns, 

Now perturb about the accelerated solution, 

R=+V3fJ? +SR => -26 R- -^L- \3SR + 0{SR 2 ) = . (119) 

By comparing the linearized equation for 5R with that of a positive mass- 
squared scalar, 

(□-m 2 )¥> = 0, (120) 



4 Although extensions involving R^ v and R pc rA " ' R pa M „ have also been studied 
)55l . they must be ruled out on account of the Ostrogradskian instability. 
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we see that 8R behaves like a tachyon with m 2 = ~^/3/i 2 . However, because 
explaining the current phase of acceleration requires /i ~ 10~ 33 eV, the re- 
sulting instability is not very serious. I should note that the existence of a 
tachyonic instability in no way contradicts the Ostrogradskian analysis that 
this model's higher derivative degree of freedom carries positive kinetic energy. 



5.1 Inside Matter 



Dolgov and Kawasaki [57| showed that a radically different result emerges 
when this model is considered inside a static distribution of matter, 

T ^ = P s l$° with SttGp = M 2 -»fi 2 . (121) 

In that case the trace of (111611 gives, 

-i?+^ + n(|J)=-M 2 . (122) 

As might be expected, the static Ricci scalar solution in this case is dominated 
by M rather than /i, 

1 



R = -^M 2 + ^/W+Uf?j ~ M 2 . (123) 



Perturbing about this solution gives, 

R = R + SR -5R- ^£-6R- ^USR + 0(5R 2 ) = . (124) 

R R Q 

Comparing with the reference scalar (|120|l now reveals an enormous tachyonic 
mass, 

2 6ju 4 6^ V ' 

Plugging in the numbers for the density of water (p ~ 10 3 kg/m 3 ) gives M ~ 
10~ 18 eV, implying a tachyonic mass of magnitude |m| ~ 10 12 eV = 10 3 GeV! 

As disastrous as this problem might seem, Dick and Nojiri and 
Odintsov |59| have shown that it can be avoided by changing the model 
slightly, 

Because an R 2 term has global conformal invariance, it makes no contribution 
to the trace for constant R. Hence the cosmological solution of R — +y/3/j 2 
is not affected, nor is the static solution inside the matter distribution (|121fl . 
However, the equation for linearized perturbations inside matter changes to, 
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5R-%-SR + 3(-^- + ^)U6R = 0. (127) 



R 2 o V R* ^ 

The instability of Dolgov and Kawasaki was driven by the smallness of 2/i 4 / Rq . 
By simply taking a positive and of order one the tachyon becomes a positive 
mass-squared particle of m 2 ~ n 2 /a. 

5.2 Outside Matter 

Marc Soussa and I analyzed force of gravity outside a matter distribution |f>()| . 
Although our analysis was for the original f(R) = —/j, 4 /R model, there would 
be only slight differences for the extended model (|126fl . So our result seems 
to foreclose this possibility, but see |ST). 

The tachyonic instability could be studied using the perturbed Ricci scalar, 
but the gravitational force requires use of the metric. We perturbed about the 
de Sitter solution with Hubble constant H = /i/(48)~ in co-moving coordi- 
nates, 

ds 2 = -(l-h m )dt 2 + 2a(t)h 0l dtdx l +a 2 (t)(S lj +h. lJ )dx i dx j with a(t) = e Ht . 

(128) 

In the gauge, 

V"-^ + 3 VM* = o. ( 129 ) 

with h = —hoo + ha, the perturbed Ricci scalar takes the form, 

SR= -^d 2 h + 2Hd a h . (130) 

Our strategy was first to solve the de Sitter invariant equation for the per- 
turbed Ricci scalar, then reconstruct the gauge-fixed metric. 
We assumed a matter density of the form, 

p{t,x) = ^Le(R g -a{t)\x\) . (131) 

The exterior field equation has a simple expression in terms of the coordinate 
y = a(t)H\x\, 



( ^ ) dv 2 ~^ v ( ^ ) dv 



8R = . (132) 



dy 2 y\ J dy 

The solution takes the form, 

SR = Pifo(v)+lhf-i(v) , (133) 



where fo and /_i are hypergeometric functions whose series expansions are, 
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fo(y) = 1 - 2y 2 + ly 4 + . . . , (134) 
/-i(2/)-^(l-7y 2 + yy 4 + ...) . (135) 

We only need the behavior for small y because y = 1 is the Hubble radius! 
Matching to the source at y = HR g determines the combination coefficients 
to be, 

fa --#r , fa * -12GMH 3 . (136) 
K g 

This last step might seem bogus because we needed to regard the mass density 
as a small perturbation on the cosmological energy density /i 4 , whereas the 
opposite would be the case for galaxies or clusters of galaxies. However, this 
will only make changes of order one in the /Vs. I n particular, the asymptotic 
solution must still take the form (|133|l . 

The next step is solving for the trace of the perturbed metric. It turns out 
that relation (|130|) can also be expressed very simply using the variable y, 



J dy y\ 



h '(y) = Ja«* • (137) 



We only need to solve for the derivative of h because that is what gives the 
gravitational force in the geodesic equation. The solution is, 

h\y) = -j^y + 0(y*). (138) 

This should be compared to the general relativistic prediction, 

, , , AGMH , % b! 1 /IIjcIK 3 , , 

h 'oK(y) = — + =► j^ = 2\r) ■ (139) 

One consequence is that the force between the Milky Way and Andromeda 
galaxies would be about a million times larger than predicted by general 
relativity! 



6 Conclusions 

The potential of a quintessence scalar can be chosen to support any cosmology, 
but the epicyclic nature of this construction suggests we consider modifications 
of gravity. Ostrogradski's theorem JHJ limits local modifications of gravity 
to just algebraic functions of the Ricci scalar. Models of this form can give 
a late phase of cosmic acceleration such as we are currently experiencing. 
However, they can be tuned to give anything else as well. They seem every 
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bit as epicyclic as scalar quintessence. Further, the f(r) = — /i 4 /i? model is 
problematic, both inside and outside matter sources. 5 

An interesting and largely overlooked possibility for modifying gravity is 
the fully nonlocal effective action that results from quantum gravitational 
corrections. In weak field perturbation theory it has long been known that 
the most cosmologically significant one loop corrections are not of the R 2 
form usually studied but rather of the form R\n(C)R [SHI- More potentially 
interesting is the possibility of very strong infrared effects from the epoch of 
primordial inflation |64l I65| . 

It can be shown that quantum gravitational corrections to the inflation- 
ary expansion rate grow with time like powers of ln(et). Although suppressed 
by very small coupling constants, the exponential growth in a(t) during in- 
flation must eventually cause the effect to become nonperturbatively strong 

KT7| . Similar secular growth occurs as well for minimally coupled scalar 
field theories |681 lrJ§| , in which context Starobinskh has developed a tech- 
nique for summing the leading powers of ln(a) at each loop order [701 ITT] . 
If Starobinskh's technique can be generalized to quantum gravity jZ2J E3] it 
might result in a nonlocal effective gravity theory for late time cosmology in 
which a large, bare cosmological constant is almost completely screened by 
a nonperturbative quantum gravitational effect. In such a formalism the cur- 
rent phase of acceleration might result from a very slight mismatch between 
the bare cosmological constant and the quantum effect which screens it. It is 
even conceivable that one could reproduce the phenomenological successes of 
MOND E] with such a nonlocal metric theory, although it would have to 
unstable against decay into galaxy-scale gravitational waves [71] . 
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